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PROBLEMS  OF  IDENTIFICATION 

D.  P.  GAVER 
P.  A.  JACOBS 
I.  G.  O' MUIRCHEARTAIGH 
A.  MELDRUM 

1 .  Formulations:  Background  and  Introductory  Comments 

Envision  this  abstract  situation.  There  are  J  items,  distinct  from  one 
another  and  bearing  names.  Think  of  them  as  BIRDS  of  different  types.  Each 
one  is  characterized  by  a  vector  of  identifying  components  (you  can  possibly 
think  of  physical  characteristics  such  as  color,  flight  speed, 
characteristics  of  song,  etc.  as  these  -parameters) .  In  addition,  it  may  (or 
may  not)  be  known  where  the  items  are  located  geographically;  they 
occasionally  move,  and  may  move  together  in  suitable  flocks. 

Next,  there  is  a  group  of  individuals,  called  WATCHERS,  who  are 
sensitive  to  the  parameters  (physical  characteristics)  Just  mentioned,  when 
the  latter  become  evident.  In  effect,  some  bird  may  sing  his  song,  and  one 
or  more  of  the  WATCHERS  will  hear  and  record  various  features  of  the 
song,  but  with  error;  the  3ame  with  other  features  or  parameters. 
Observations  are  made  "in  the  dark:"  observing  WATCHERS  cannot  see  the  BIRDS 
before  the  song  and  other  qualities  become  evident.  In  fact,  the  objective 
of  the  group  of  individuals  is  to  collectively  identify  the  BIRDS  in 
question  as  well  as  possible  just  by  comparing  notes  on  the  parameter 
announcements,  e.g.  song  and  other  feature  characteristics. 

Errors  of  various  types  are  easily  made,  depending  upon  the  operating 
characteristics  of  the  WATCHERS  and  on  the  distribution  of  the  parameters. 


,  V* 
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Perhaps  a  group  of  BIRDS  will  all  sing,  and  two  WATCHERS  will  confuse  two  or 
more  BIRDS  whose  songs  they  hear:  both  will  state  their  estimates  of  "the" 
song  length  of  what  they  take  to  be  a  single  BIRD,  whereas  in  fact,  they 
have  focused  on  two  different  BIRDS  with  similar-enough  songs. 

If  the  individual  assessments  are  error-prone  (as  they  will  be)  and  if 
the  distribution  of  the  vector  parameters  is  unfortunate,  being  tightly 
concentrated  around  a  point  in  p-space  (parameter  space)  the  advantage  of 
the  WATCHERS  is  minimal:  they  will  be  unable  to  accurately  discern  a 
particular  BIRD'S  presence,  much  less  how  many  BIRDS  are  singing.  If 
several  WATCHERS  are  responding  to  two  different  BIRDS,  their  composite 
single  assessment  of  the  parameter  may  fail  to  conform  to  anything  real. 

The  general  problem  is  to  identify  singing  BIRDS  using  error-prone  and  even 
gross  error  (outlier)  prone  observations. 

With  this  as  background  we  begin  to  formulate  a  variety  of  simple 
problems  and  to  consider  their  implications. 


actually  is  very  near  Uj ,  then  we  announce  confidently  that  we  have  heard  an 
announcement  by  George  the  robin  ("by  George,  I  think  I  heard  him").  Things 
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might  not  be  quite  so  simple:  the  actual  parameter  announced  may  be 
distributed  somewhere  near  the  value  p j .  Increasing  the  spread  or 
variability  of  announced  values  of  9  around  p^  will  be  confusing  to  each 
WATCHER,  and  the  announcement  that  we  indeed  heard  George  himself  becomes 
less  likely  to  be  true. 


so  WATCHER  i  ( i-1 , 2 , . . . , I)  estimates  the  value  of  the  parameter  0  with 

errors  that  are  N(9,o*).  For  the  moment,  assume  that  there  is  just  the  one 

BIRD  present.  If  all  I  WATCHERS  independently  estimate  0  and  do  so  with 

independently  distributed  errors,  then  it  makes  sense  to  write  down  and 

examine  the  likelihood  function 

I 

L(0;x)  -  n  f  (x  ;0)  (2.U) 

1-1 
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or,  taking  logs  and  concentrating  on  the  Normal  form, 


I 

i(0;x)  -  l  In  f  (x.;6) 
i-1 

-*  •4  j,  ^ 

2 

omitting  irrelevant  constants.  Now  with  known  the  unrestricted 
maximization  of  the  likelihood  produces  the  time«honored  formula 


0  - 


[  x  /o’ 
1-1 

l  1/of 

i-1 


(2.6) 


i.e.  the  variance- weighted  mean  of  the  individual  observations.  The 
variance  of  the  estimate  is 


Var[9]  -  -j — - .  (2.7) 

l  1/of 

i-1 

If  all  the  above  assumptions  hold  true,  then  one  presumably  compares  0  to 

the  "known"  parameters  of  various  BIRDS  and  picks  BIRD  j#,  where 

|  Q  -  w  |  <  |e  -  u  |,  J  # ;  call  this  the  nearest  neighbor  strategy,  NN. 

J  *  J 

A 

Because  of  symmetry,  the  solution  0  is  also  the  mean  of  a  Bayes  posterior 
with  non- informative  (flat,  improper)  prior.  It  is  also  the  best  linear, 
unbiased  ( BLUE)  estimator  of  0,  so  it  should  be  at  least  mildly  satisfactory 


to  most  non-rabid  statisticians  of  any  faith  or  persuasion.  In  the 
important  (oversimplified)  case  in  which  each  individual  bird  sings 
precisely  on  key  so  the  unknown  9  -  Uj  for  some  J  then  the  procedure  yields 
the  maximum  likelihood  estimator  of  y  from  the  restricted  parameter  space 
•••»  VijJ*  See  Hammersley  (1950)  for  an  early  discussion  of  this 

problem. 

2. 2  A  More  Robust  Likelihood 

While  many  (perhaps  transformed)  measurement  errors  of  physical 
quantities  are  approximately  Normal,  especially  "in  the  middle"  of  their 
distribution,  there  can  well  be  occasional  outliers,  in  this  case  possibly 
caused  by  Individual  mls-performance.  In  order  to  model  thi3  empirically 
observed  feature,  it  is  becoming  conventional  to  extend  the  tails  of  the 
Normal  in  (2.3)  in  one  of  these  way3 

(a)  continuous  scale  mixing,  where  o*  is  taken  to  be  a  conveniently 
distributed  (e.g.  inverse  Gamma)  random  variable. 
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in  which  usually  1  -  is  close  to  one  and  is  relatively 

small  (o^  <  oi2),  while  ej_  >  0  is  dose  to  zero,  but  o12  is 
large,  e.g.  o12  -  lOc^.  This  model  was  utilized  in  a  classical 
robustness  context  by  Tukey,  and  also  by  Berger  and  Berliner 
(1983)  for  Bayesian  robustness  purposes. 

Begin  by  discussing-  (a).  This  approach  can  (but  need  not)  lead  to 
replacement  of  the  normal  observation  density  by  a  Student  t  form: 


f1(x1;9) 


(2.9) 


i  2 

Here  view  dj  as  a  shape  tuning  parameter;  VarCX^  -  2  if  d^  >2,  but 

kurtosls  (fourth  central  moment  scaled  to  be  dimension-free)  can  induce  very 
extended  tails,  simulating  outlier  occurrence.  If  d^-  1,  we  get  the 
centered  and  scaled  Cauchy,  with  notoriously  long,  symmetric  tails.  The 
Cauchy  tails  are  so  long  that  neither  mean  nor  variance  —  nor  any  other 
moment  --  exists.  The  likelihood  obtained  by  combining  individual  measures 
is  now 


L(0;x) 


C(d1) 

x.-0  2  ,  ( d  +1  )  /2 

[i  *  (-H  jij  1  ■ 


(2.10) 


and  so,  up  to  irrelevant  constants. 
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I  d  +1 


x -e  2 


In  L(0;x)  -  £( 0 5 x )  -  £  (-—)  ln[l  +  (~— )  -±). 

*  i-1  ai 


Now  differentiation  with  respect  to  0  gives 


.  .  2(x  -0)  -jt- 

I  d  *1  i  a:d . 

dl  r  r  1.  -I  _ 1  1 

30  “V  2  1  X  -0  2 

'  *  (-v>  i 


(2.11) 


as  a  condition  for  a  maximizing  0,  denoted  by  0.  In  principle  this  equation 
could  have  more  than  one  solution;  Copas  has  discussed  this  situation. 

To  obtain  a  usually  sensible  (optimal)  solution,  proceed  as  follows: 
Iterative  Rewelghtlng 

Rewrite  (2.11)  as  follows: 


1  -  i 

0:  l  (xj-0(r+1 )  )—pr  *  v»1(r )  -  0 


(2.12) 


I  x 


1  (^4)  w  (r) 

4  ^  1  A 


0(r+1)  - 


i-1  i 


I  (tt)  w  (r) 
i-1  i 


where  the  weight 


(2.13) 


w^r ) 


x.-0(r)  2 


(2.14) 
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One  might  start  the  iteration  at 


0(1)  -median  (x1,x2,  ....  Xj). 

and  then  compute  the  first  weight 


(2.15) 


w4(1) 


V1 


'i 


x  -0(1)  2  , 

1*(-i - )  ^ 

•  °i  dl 


(2.16) 


and  use  this  to  find  the  second  estimate  0(2).  Even  the  first  iteration,  as 
described,  will  be  quite  successful  at  taming  down  Individual  widely 
discrepant  values,  or  "outliers".  The  smaller  is  di  (di  2  1,  presumably) 
the  more  effectively  discrepant  values  are  reduced  in  influence. 

After  obtaining  convergence,  one  may  apply  the  NN  approach  to  identify 
j#,  the  name  (number)  of  the  BIRD  actually  singing. 

The  above  procedure  will  usually  work  satisfactorily,  but  may  err 
because  of  an  unfortunate  starting  estimate.  If  each  BIRD  sings  almost 
precisely  on  key,  so  the  unknown  0-Uj  for  some  j,  then  a  precise  maximum 
likelihood  solution  can  be  obtained  by  simple  enumeration:  one  simply 
evaluates  (2.10)  for  0  -  lu  .j^,  •••»  Uj)  and  picks  j-J*  that  gives  the 
maximum.  For  small  J  this  is  computationally  feasible  and  provides  the 
truly  maximum  likelihood  solution  given  the  problem  formulation.  On  the 
other  hand,  the  weights  produced  by  the  iterative  solution  provide  a 


convenient  index  as  to  the  relative  importance  to  be  attached  to  the  data 
values  by  the  iterative  procedure,  so  it  3eems  sensible  to  become  a  "weight 
watcher".  The  pattern  of  weights  might  suggest  reasons  for  relative  comfort 
or  discomfort  with  an  identification:  for  Instance,  a  relatively  uniform 
distribution  of  low  weights  perhaps  gives  discomfort,  while  mostly  high 
weights  with  a  few  very  low  ones  thrown  in  may  give  reason  for  comfort  — 
presumably  with  a  consensus  of  the  high-weighted  values.  One  fact  that 
should  be  noted  is  that  the  likelihood  equation  (2.10)  may  well  have 
multiple  peaks  or  modes,  and  the  primary  one  is  presumably  usually  found  by 
the  re-weighted  iteration  NN  scheme  suggested.  In  any  case,  the  parameter 
space  point-by-point  enumeration  is  often  feasible. 

Next  discuss  the  e-contamination  model  (b).  Unfortunately  the 
likelihood  is  of  an  awkward  form 

1  -  i  x  -0  2  i 

L(0;x)  -  n  [e  exp[-  ±  - )  ]  — - - 

i-1  1  2  °i1  on 

•  'i  “»I-5  1  —  ’•  (2-m 

i2  /2tt  o i2 

Now  it  can  be  seen  that  if  the  multiplication  is  carried  out  and  some  re¬ 
arrangement  is  done,  we  can  express  the  likelihood  as 

K  i  2  i 

L(0;x)  -  l  w^x)  exp[  -  -  (  )  ] -  (2.18) 

K-1  k  —  /2 it  o^lx) 


*•9- 


where 


irk(x)  -  irk  exp{-^  Rk(x)}  '  (2.19) 

and  K  -  21.  It  turns  out  that  ^(x)  is  a  linearly  weighted  function  of  the 

2 

individual  observations,  and  1/ok(x)  is  a  corresponding  sum  of  Inverse 
variances;  R^x)  measures  the  discrepancy  of  the  Individual  observations 
from  uk(x)» 

For  illustration,  suppose  1-2,  so,  up  to  multiplicative  constants, 

L(9,x)  -  $  Ui  4 -  e*p{-i  (— )  1  ♦  E,  -  exp{-i  (4“ )  I  3 

/2ir  11  / 2it  a12  12 


*  I  ir  (x)  exp[-i 
k-1 


0-U  (x) 

(-44-)  1 


°k(-)  /2ir  ak(x) 


where  -  t e1  ,  e.^,  eie2’  e1e2^;  the  term3  and  °k^-^  are 

obtained  by  completing  the  square  in  the  exponent  of  each  summand. 

The  form  of  (2.18)  suggests  that  L(0;x)  is  a  possibly  multimodal 
function,  as  was  true  of  the  Student  t  likelihood.  An  iterative  scheme  can 
be  set  up  as  detailed  below  to  estimate  0  and  the  NN  approach  can  then  be 


taken.  If  each  BIRD  sings  almost  precisely  on  key  so  the  unknown  0 


for 


some  j,  then  a  precise  maximum  likelihood  solution  can  be  obtained  by  simple 


enumeration  as  before. 
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Iterative  Rewelghtlng 

Taking  logarithms  of  (2.17)  and  differentiating  with  respect  to  9 
results  In  the  equation 


H  -  Os  l  (x.  -0(r*1))  [(-^)*  ♦  w  (r)[(-^-)*  -<-!-)*]] 
30  1-1  1  °12  1  °11  °12 


1  \  2 


1  ,2  1  ,2- 


(2.20) 


where  the  weight 


A  (r) 

wi<r)  '  IT*.(r) 


(2.21 ) 


'1  1 


with 


A  (r)  -  (1-e  )  o  exp j-  ~  (x  -0(r))2  [ ( — ^ — ) 2- ( — ^ — ) 2 3 } . 
l  i  xd_  d  i  o^2 

°11 


(2.22) 


As  for  the  Student  t  distribution,  one  might  start  the  Iteration  at  0(1) 
median  (x1 ,x^>  • • .x j)  and  then  compute  the  first  weight;  use  the  weights 


to  find  the  second  estimate;  etc. 


3.  Bayesian  Formulations;  Everything  Normal 


In  addition  to  the  Information  on  0  coming  from  individual  i  there 
may  be  Information  on  0  codable  in  the  form  of  a  probability  density: 

P_(0).  The  latter  may  actually  take  the  form  of  a  series  of  near  delta 

functions,  one  for  each  of  the  BIRDS  in  question.  For  the  sake  of  a  bit  of 
generality  write 


Y9’  - j,  )  irr- 


(3.D 


Here  possibly  «  y,  where  J  represents  the  number  of  BIRDS  believed  to  be 
in  the  vicinity  and  of  interest.  If  Tj  -  0  then  the  above  indeed  represents 
a  "Dirac  comb"  with  teeth  at  the  points  Uj,  J-1,2,...,J;  the  sharpness  of 
the  teeth  dictated  by  tj :  small  Tj  means  that  the  Jth  tooth  (density)  is 
long  (tall)  and  sharp. 

Now  by  routine  Bayes  we  get  for  the  posterior  density 


P-Iv  (0)  -  K  n  fY  (x,;0)Pa(0) 
0|x  1-1  *L  1  9 


(3.2) 


and,  if  we  adopt  the  Normal  model. 


peix  ,9>  •  K1;1  fxl(xiie)p!(9) 
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I  x,  H02  J 


e  «  w,  2. 


K  exp[-  1  l  c^— )  ]  •  I  p.  exp[-l  (l^i)  ].  (3.3) 

i-1  j-1  J  Tj 


This  can  be  simplified:  write 


0  -  ji .  (x )  2  6  -  p.  2  I  x.-0  2 

(-rstf')  *n-H 

V*  J  Tj  i-1  °i 


(3.^0 


where  doesn't  depend  on  0,  and  look  for  Uj(x)  and  tj  (x)  in  terms  of  the 
observations  and  parameters;  to  do  so  differentiate  with  respect  to  0  to 


0  -  u . ( x )  0  -  w.  I  0  -  x. 

2  (— TT^yr-)  -  2(— ♦  2  I  (-p-i). 

J  -  Tj  i-1  °i 


(3.5) 


Since  the  coefficients  of  0  and  1  on  each  side  of  the  equation  must  match. 


1  1  +  y  1 

W  T  iii  "T 


(3.6) 


.(i4  I  x 


(x)  -  [-4-  +  f  -t]  t*(x). 
J  Tj  1-1  i  J 


(3.7) 


To  identify  Qj  let  0  -  Uj(x)  in  (3*1*);  then 


u.(x)-y.  2  I  x ,-u.(x)  2 

0.  -  (  J-:  J  )  •  I  ; 

J  j  1-1  i 


(3.8) 


that  is,  Qj  Is  the  scaled  sum  of  squared  deviations  of  (a)  the  j  posterior 
mean  from  the  prior  mean  and  (b)  the  Jthposterior  mean  from  eaoh 
individual  observation.  Now  return  to  (3.3)  and  substitute: 


e-p.(x)  2 


“e|x  <6)  •  pj  expt-  I  !exp(-  i  ajl- 


(3-9) 


By  normalization, 


L  Pe|i  <e)  “9 


J  1  tm  ,  e-u.(x)  2  .. 

ltJ,  pjexp[‘  5  0J]  V*’  i  exp[-  I  '-rjuT1  >  •  T} 


- 

K  I  Pjexp[-  i  Qj]  /Tlf  tj(x). 


Thus  the  posterior  density  is  of  the  form 


TjU) 


(3.10) 


e-u4(x)  2 


J1  -VKi.VA/e-  1 

p9|x(9)  -}l  pj'5>  *xp[-  2  (-rjrfy)  ^ 


(3.1D 


where 


pj'<x>  ■ 


-  2  Q1 

P.e  ‘  J  t  .  ( x ) 


J  -  4  Q, 

I  P.e  2  J  t  . ( x ) 
J-1  J  J 


(3.12) 


^  A-’i  *-*■  ■> 


In  other  words  by  completing  the  square  in  (3.*U,  the  resulting  form  of  the 

posterior  density  (3.12)  and  the  prior  density,  (3.1).  resemble  each  other 

2 

closely.  For  the  important  special  case  in  which  ij  -  0,  j-1,2,...,J,  one 
obtains  the  discrete  distribution  concentrated  at  the  values  u 

J 

(j-1,2 . J)  having  probability  mass  function 


P{0  -  Uj |xl 


Pj'(x) 


P  exp[-  J  l  (  X  -  u,)2/a2] 

_  1-1  J  _ 

l  P  exp[-  ^  Z  (x  -  u  )2/a\] 
J-1  J  i-1  1  J 


(3.13) 


The  component  probabilities  P^  are  very  simply  modified  in  accordance  with 
the  observations,  and  can  be  easily  updated  as  more  observations  become 
available.  One  could  hope  that  after  a  set  of  observations  has  become 
available  then,  say, 


and 

Pj  -  0  for  j  *  3, 

which  points  strongly  at  BIRD  3  as  being  the  one  that  is  actually  singing. 

If,  on  the  other  hand,  all  P^ -values  were  to  remain  similar  it  might  be 

thought  that  some  individuals  have  focused  on  two  or  more  different  items, 

or  that  the  noise  is  not  well  represented  by  the  normal  (Gaussian)  model. 

This  possibility  is  not  included  in  the  present  model,  however.  Note,  too, 

that  if  the  {P.  »  -r,  J  -  1 ,  2,  ...,  Jl,  the  discrete  uniform  distribution, 

J  J 

then  naming  by  picking  the  maximum  probability  from  (3.13)  is  exactly 

equivalent  to  maximizing  the  likelihood  by  direct  enumeration. 
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3.1  Bayes  Ian  Formulations;  (1)  Student  t  Observations 

Let  the  prior  information  on  BIRDS  be  given  by  (3.1).  Individual 
watchers  independently  observe  0  with  errors  distributed  according  to  the 
Student  t  family: 


fXl(xi;e)  * 


C(d1) 


x.-e  , 

[’  *  <-v>  ^ 


(dj  ♦  1 )/2  o1 


k.v. 


Then  the  posterior  density  is  of  the  form 


I  J  .  0-u.  2 

P  ,  (0)  -  K  n  f  (x  ;0)  I  P  exp[-  ?  (— f~)  ] 
1-1  Xi  1  J-1  J  *  Tj 
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(3-15) 


x  -0  2 


-  K*  l  P.  exp[-  \  )  Jexp[-  \  [  (d  *1)  ln[l  ♦  (-£ — j  t-O  ] 

j-1  J  Tj  1-1  1  i  i 

In  order  to  normalize  this  expression  (determine  K'),  and  to  compute  moments 
(for  point  estimates,  the  NN  approach,  etc.)  it  is  necessary  to  integrate 
over  all  0-values;  of  course  no  simple  analytic  closed  form  expression 
exists.  There  are  two  practical  options: 

fa)  numerical  integration,  using  Gauss-Hermite  integration,  e.g.,  by 
adopting  the  program  of  Naylor  and  Smith  M982);  or 
(b)  analytical  approximation,  using  a  variant  of  the  Laplace  method, 
see  deBruijn  f 1 95 8 )  or  the  equivalent;  this  classical  approach  has 
been  Invoked  by  Mosteller  and  Wallace  (1964)  ,  Gaver  (1985), 


■16- 


............. 


.A  A  A  f  -  * 


1a  *  *  •  -  •  H  .  '  \ 


A 


Tierney  and  Kadane  (1986),  Lindley  and  Singpurwalla  (1986),  and  by 
many  others  as  well. 

To  apply  Laplace,  write 


J 

K  l  P.e 
J-i  J 


(3.16) 


where 

8-y  2  I  x  -0  2  . 

S  -  (__I)  ♦  l  (d  ♦  1)  ln[l  ♦  (-i— )  •  j-]  .  (3.17) 

J  TJ  i-1  i  i 

The  plan  is  to  replace  by  an  approximating  quadratic  in  9,  and  thus  to 
exhibit  closed-form  approximating  expressions  for  the  updated  probabilities 
Pj(x)  that  are  quite  analogous  to  the  "exact"  formulas  (3.12)  obtainable 
■under  normal/Gausslan  error  specifications. 

To  proceed,  assume  that  the  exponent  is  of  the  form 


0— u  ,  ( x  )  2 

sj  =  *  V*1 


(3.18: 


where  Qj(x)  is  at  least  nearly  independent  of  0.  Now  differentiate  the  two 


forms  on  0: 
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e-y,(x) 

2i^] 


e-u,  i  a,»i 

2  (-^>  *  I  2  (-H 

TJ  i-1  i 


( 0— X1 ) 
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’l '  dl 


(3.19) 


Now  Identify  the  coefficients  of  0  and  1  to  see  that 
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(d1  ♦  l)/di 
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and 


Uj(x) 


u  4  I  x. 

-  t2(x)  [hJ  ♦  I  -p  w  ] 
J  Tj  i-1  °i 


I  p  1 

T  ^  oT”  wi 

j  i-1  i 


(3.20) 


(3.21) 


where  the  weights  are 


(dL  ♦  1)/d 
x  -9  2 

*  (V  ^ 


(3-22) 


In  practice,  it  will  be  necessary  to  estimate  0  by  an  iterative  re-weighting 
procedure,  so  approximate  weights  win  be  used: 


w  m 

i 


(d  ♦  i )/d 
x  -0  2 

i’  *  4r>  371 


(3.23) 


Now  replace  S 


J 


in  (3.16)  by  the  quadratic  approximation  to  find 
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(3.2*0 


where  the  approximate  prior  probability  revision  factor  is  from  (3.18) 


y  -u.(x)  2  I 


Q,(x)  -  )  ♦  l  (d  +  1)  ln[  1  +  “ 

J  J  i-1  i 


«1  1  “l  1 


(3.25) 


and  so 


»  P  e  2QJ(-)  t  (x) 

P*(x)  -  -J - T5 - L^L- 

J  "  to"  4<i,(x). 


(3.26) 


l  P.e' .(*) 
J-1  J  J 


provides  the  approximate  data-updated  probability  that  BIRD  j  is  singing. 

JL  *  !L  A  A  Jt 

It  is  reasonable  to  designate  j-j  if  P^^(x)  >  Pj(x)  for  ji*j  .  The  form  of 
the  Bayes  posterior  is  of  course  quite  analogous  to  that  derived  for  the 
normal  errors  case.  Here  we  have 


0~u . ( x )  2 


W  *  JL  4  V/  H  \  6-  1 

Peli(9)  ‘  j,  V*  exp[-  3 


(3.27) 


with  the  squared-error-minimi zir.g  point  estimate,  i.e.,  the  expected  value 
of  9  given  observations  x,  is 


1)  jf*1'  /'vVvV'/Wv 

v>.-.  •  ^  \r-.-_rV  -j.---x.~- . 
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e  -  l  p*  (x)  u .(x) . 
J-1  J  J 


It  Is  this  value  that  should  apparently  appear  In  the  weights,  w1 ,  in  the 
course  of  the  iterative  calculation. 

A 

The  behavior  of  Qj(x),  and  hence  of  the  prior  to  posterior  revision 

A# 

P  •*  P.(x)  seems  Intuitively  appealing:  first,  one  is  lead  to  estimate  the 

J  J 

most  likely  characteristic  of  the  song  of  the  J BIRD,  (x.)  >  by  combining 
data  Xj ,  1  -  1,  2,  ...,  I  (using  the  knowledge  that  outliers  will  occur,  so 
the  estimate  is  made  robustly)  and  prior  information  about  the  variability 
of  BIRD  J's  song.  Then  this  estimate  is  effectively  compared  to  (1)  the 
candidate  true  mean  value  of  jth  BIRD’S  song,  Uj ,  and  (2)  the  data  obtained 
by  the  listeners;  both  (1)  and  (2)  measured  on  an  appropriate  scale  of 
variability.  If  the  sum  of  these  distances  (squared  and  scaled)  is  small, 
the  conditional  probability  of  J  being  the  songster  is  correspondingly 
increased;  otherwise,  it  is  reduced. 

If  tj  -  0,  j-1 , 2, .... J,  so  the  BIRDS  always  sing  precisely  on  key,  then 
the  above  density  becomes  a  probability  mass  function: 


P*(x)  -  P { 0— j jx [ 


I  C(di) 

"  K  PJ,n,  X  -u.  2  .  (d  rl)/2 
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where 
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determines  K.  To  identify  the  optimal  j-j  ,  simply  locate  the  maximum  Pj . 

4.  Results  of  Simulation  Experiments 

This  section  reports  some  of  the  results  of  simulation  experiments  to 
study  the  performance  of  various  methods  of  combining  WATCHER  observations 
to  obtain  an  estimate  of  the  parameter  of  the  singing  BIRD.  All  simulations 
were  carried  out  on  an  IBM  3033AP  at  the  Naval  Postgraduate  School.  Random 
numbers  were  generated  using  IMSL  routines.  Some  details  and  results  of  the 
simulations  are  given  below;  for  more  see  Meldrum  [1986]. 

4. 1  BIRDS  with  Univariate  Parameters 

There  are  5  BIRDS  with  parameters  (u^}  equal  to  1,  2,  3.  4  and  5.  The 
BIRD  that  sings  has  parameter  Uj  with  probability  p^  . 

The  number  of  WATCHERS  varies  between  2  and  5.  The  observation  of  the 
1th  WATCHER  is 

Xl  -  0+E1  (4.1) 

where  9  is  the  parameter  of  the  BIRD  that  sings  and  is  the  observational 
error.  The  distributions  of  observational  error  considered  are: 

1)  the  normal  distribution  with  mean  0  and  standard  deviation  o  -  0.5 
(e.g.  (2.3)); 

2)  the  e-contaminated  normal  (2.8)  with  mean  0,  standard  deviations 
a1 -  0.5  and  m  5,  and  contamination  probability  e  -  0.1  and  0.25;  CN(e); 
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3)  the  Student  t  distribution  (2.9)  with  o  -  0.5  and  d  -  1,  which  is 
the  Cauchy  distribution. 

Each  simulation  case  has  10,000  replications.  In  each  replication  the 
BIRD  with  parameter  was  drawn  to  sing  with  probability  p^  and  WATCHER 
observations  were  generated.  The  following  estimates  of  the  parameter  of 

A 

the  singing  BIRD  9  were  computed: 

1 .  the  mean  of  the  observations; 

2.  the  median  of  the  observations; 

3.  the  iterative  Student-t  estimate  (2. 1 2)- (2. 1 6)  with  assumed  values 
a  •  0.5  and  d  -  1  and  10; 

4.  the  iterative  e-contumlnated  normal  estimate  (2.20)-(2.22)  with 
assumed  parameter  values  o1  *  0.5,  a 2  -  5.0  and  t  -  0.1  and  0.25. 

In  each  case,  the  BIRD  whose  parameter  was  closest  to  the  estimated  0 
was  estimated  to  be  the  BIRD  that  sang. 

In  each  replication  Bayes  procedures  for  combining  WATCHER  observations 
were  also  considered.  The  prior  probability  of  the  BIRD  with  parameter  u^ 
singing  was  assumed  to  be  1/5  in  each  case;  (equally  likely  prior).  The 
assumed  error  distributions  for  the  Bayes  models  were  as  follows: 

1.  normal  with  mean  0  and  standard  deviation  o  -  0.5; 

2.  Student  t  with  o  -  0.5  and  degrees  of  freedom  d  -  1,  and  10; 

3.  The  e-contaminated  normal  with  o1  -  0.5,  -  5  and  e  -  0.1  and 

0.25.  For  each  assumed  error  distribution  the  posterior  probability  of  the 
singing  BIRD  having  parameter  was  computed. 

The  BIRD  whose  parameter  had  the  largest  posterior  probability  was  the 
estimate  of  the  BIRD  that  sang. 
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In  Table  1  proportions  of  correct  identifications  are  given  for  a 
simulation  in  which  BIRD  with  parameter  sings  with  probability  1/5.  The 
number  of  WATCHERS  varies  between  2  and  5.  Observation  errors  were 
generated  using  the  "true  error  distribution".  Estimates  of  0  were  computed 
using  both  correct  and  incorrect  assumptions  given  in  the  first  column 
concerning  the  error  distribution.  The  Bayes  estimates  assumed  the  equally 
likely  prior  and  correct  and  incorrect  assumptions  given  in  the  first  column 
about  the  error  distribution. 

Here  are  some  conclusions  that  may  be  made  from  the  simulations.  The 
BIRD  estimates  based  on  the  mean  and  the  normal  Bayes  model  are  the  most 
sensitive  to  incorrect  error  distribution  assumptions;  note  that  in  the  case 
in  which  the  true  error  distribution  is  e-contaminated  normal  with  e  -  0.25, 
increasing  the  number  of  WATCHERS  actually  decreases  the  proportion  of 
correct  identifications  for  these  two  procedures.  This  behavior  results 
from  the  fact  that,  with  small  numbers  of  WATCHERS,  increasing  the  number 
WATCHERS  increases  the  chances  of  having  one  or  more  outlying  observations. 

A  more  detailed  explanation  of  this  phenomenon  can  be  found  in  the  Appendix. 
All  of  the  procedures  do  about  the  same  when  the  true  error  distribution  is 
normal.  When  the  true  error  distribution  Is  not  normal,  the  Bayes  estimates 
based  on  the  correct  prior  of  equally  likely  BIRDS  and  error  distribution 
other  than  normal  or  Student  t  with  10  degrees  of  freedom  tend  to  have  the 
highest  proportion  of  correct  identifications. 

In  Table  2  the  proportions  of  correct  identifications  are  given  for  a 
simulation  experiment  in  which  the  BIRD  with  parameter  u  -  1  always  sings  in 
each  replication;  this  parameter  is  on  an  extreme  of  the  parameter  set.  In 
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Table  3.  the  proportions  are  given  for  an  experiment  in  which  the  BIRD  with 
parameter  u  ■  3  always  sings;  this  parameter  is  in  the  middle  of  the 
parameter  set.  The  other  parameters  in  the  simulations  remain  the  same  as 
those  for  the  simulation  in  Table  1.  In  particular  the  Bayes  estimation 
procedures  use  the  (incorrect)  prior  of  equally  likely  BIRDS. 

Comparing  Tables  1-3,  the  proportion  of  correct  identifications  is 
smallest  (respectively  largest)  in  the  case  in  which  BIRD  3  (respectively 
BIRD  1)  always  sings;  that  is,  the  position  of  the  parameter  of  the  singing 
BIRD  within  the  parameter  space  can  make  correct  identification  easier  or 
harder.  Once  again  procedures  based  on  the  normal  distribution  (mean  and 
normal  Bayes)  do  well  if  the  true  error  distribution  is  normal  but  tend  to 
have  smaller  proportions  of  correct  identifications  if  the  true  error 
distribution  is  not  normal;  this  decrease  in  the  proportion  of  correct 
identifications  is  greater  than  the  decrease  obtained  by  using  procedures 
based  on  non-normality  of  the  error  distribution  when  in  fact  the 
observations  have  normal  errors.  In  Tables  2  and  3  the  effect  of  using 
incorrect  prior  distributions  in  the  Bayes  models  has  been  to  make  their 
proportions  of  correct  identifications  closer  to  those  of  the  parametric 
procedures.  However,  the  Bayes  procedure  based  on  an  error  distribution 
Student-t  with  1  degree  of  freedom  appears  to  be  quite  robust  to  model 
assumptions  particularly  for  the  case  of  2  WATCHERS. 

U. 2  BIRDS  with  Bivariate  Parameters 

In  this  subsection  there  are  5  BIRDS.  Each  BIRD  has  two  parameters 
associated  with  it.  Two  conf igurations  of  the  BIRDS'  parameters  were 
considered: 
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LINE:  ^  -  (1,1),  y2  -  (2,2),  ^  -  (3,3),  -  (4,4),  ^  -  (5,5); 

BOX:  Hl  -  (2,2),  y2  -  (2,4),  y3  -  (3,3),  -  (4,2),  ^  -  (4,4). 

The  observation  errors  have  the  following  distributions 


1.  e  -contaminated  bivariate  normal  with  density  function 


f(x,y)  -  (1-e) 
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The  parameters  used  are  o1  1  «  o1  2  •  0.5;  o2  1  -  °2  2  " 
P2  ■  P1  -  -0.5,  0,  0.5,  and  e  -  0,  0,1,  and  0.25. 


Note  when  e  -  0,  the  error  distribution  is  bivariate  normal  with  o1  »  0 2 
0.5  and  p  -  -  0.5,  0,  0.5. 

2.  A  bivariate  Student-t  with  density  function 


-,fr2  vi-P>  •’ 
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where  c  is  a  constant  and  d  is  the  number  of  degrees  of  freedom.  The 
parameters  used  are  o1  -  0.5,  0 2  -  5.0,  d-1 ,  and  p  -  -0.5,  0,  0.5. 
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Each  simulation  case  has  2,000  replications.  For  each  replication  a 
BIRD  with  parameter  is  drawn  to  sing  with  probability  p j .  An  independent 
observational  error  is  drawn  for  each  WATCHER  from  one  of  the  error 
distributions.  The  WATCHER  observations  are  combined  by  using  one  of  the 
procedures  below  to  determine  the  BIRD  that  sang. 

1.  Median:  Compute  the  medians  of  the  WATCHER  observations  of  the 
first  parameter  and  of  the  second  parameter.  Compute  the  Euclidean  distance 
of  the  median  pair  to  each  BIRD  parameter  pair.  The  BIRD  whose  parameter 
pair  has  the  smallest  distance  is  estimated  to  be  the  one  that  sang. 

2.  MLE  normal:  The  observational  errors  are  assumed  to  have  a 
bivariate  normal  distribution  with  parameters  o1  -  0.5  <j2  -  0.5  and 
correlation  coefficient  p.  The  likelihood  is  calculated  for  each  BIRD 
parameter  pair  and  the  BIRD  having  the  largest  likelihood  is  estimated  to  be 
the  BIRD  that  sang. 

3.  MLE  CN :  The  observational  errors  are  assumed  to  have  an  e  - 
contaminated  normal  distribution  function  with  parameters  -  a^2  -  0.5 
and  o21  -  o22  -  5,  p  and  e.  The  procedure  is  the  same  as  2. 

4 .  MLE  T:  The  same  as  2  and  3  except  the  observational  errors  are 
assumed  to  have  a  Student  t  distribution  with  parameters  -  o2  «  0.5,  p 
and  d. 

Table  4  shows  the  proportion  of  correct  identifications  for  a  case  in 
which  the  BIRDS'  parameters  are  in  the  LINE  conf iguration.  Each  BIRD  is 
equally  likely  to  sing  for  each  replication.  The  true  error  distributions 
simulated  were  the  bivariate  normal,  the  c-contaminated  normal  with  z  -  0.1, 
the  e-contaminated  normal  with  z  -  0.25,  and  the  Student  t  with  1  degree  of 


freedom,  (Cauchy);  they  are  listed  in  the  first  column  of  the  table;  the 
correlation  coefficients  of  the  simulated  errors  were  p  -  0.5,  0,  and  -0.5 
and  are  listed  on  the  first  row  of  the  table.  The  number  of  WATCHERS  varies 
between  2  and  5.  The  estimation  procedures  are  listed  in  the  second  column 
of  the  Table  and  assumed  p  -  0.5. 

The  simulations  of  Table  5  used  the  same  models  and  estimation 
procedures  as  those  of  Table  4  except  that  the  maximum  likelihood  procedures 
assumed  p  -  -0.5.  A  comparison  of  Tables  4  and  5  indicates  that  the  value 
of  the  assumed  p  for  the  maximum  likelihood  procedures  made  little 
difference  in  the  proportion  of  correct  identifications. 

In  both  Tables  a  comparison  of  the  proportion  of  correct 
identifications  when  the  correlation  coefficient  of  the  true  error 
distribution  is  RHO  -  0.5  with  those  when  RH0  -  0  or  -0.5  indicates  that  it 
is  more  difficult  to  identify  the  correct  singing  BIRD  when  the  errors  of 
observation  for  the  BIRD'S  two  parameters  are  positively  correlated. 

A  comparison  of  the  proportion  of  correct  identifications  when  the 
normal  maximum  likelihood  method  is  used,  with  the  other  methods  indicates 
that  the  normal  estimate  is  the  most  sensitive  to  incorrect  assumptions 
concerning  the  error  distribution.  As  was  true  in  the  uninvariate  case,  use 
of  the  normal  estimate  on  data  whose  true  error  distribution  has  longer 
tails  than  normal  can  result  in  decreasing  proportions  of  correct 
Identifications  as  the  number  of  WATCHERS  increases. 

Table  6  reports  results  of  a  simulation  experiment  with  models  and 
estimation  procedures  the  same  as  in  Table  4  except  that  BIRD  whose 
parameter  is  (3,3)  always  sings.  Comparison  of  the  results  of  Tables  4  and 


T 
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6  indicates  that  the  position  of  the  singing  BIRD  in  a  pattern  can  affect 
the  chances  cf  a  correct  identification. 

Table  7  reports  results  of  a  simulation  experiment  in  which  the 
parameters  and  estimates  are  the  same  as  those  of  Table  4  except  that  the 
BIRDS'  parameters  are  in  the  BOX  pattern  Instead  of  the  LINE  pattern.  A 
comparison  of  Tables  4  and  7  indicates  that,  if  the  correlation  coefficient 
of  the  true  error  distribution  is  p  -  0.5  (respectively  p  »  -0.5),  then  it 
is  easier  (respectively  harder)  to  make  a  correct  identification  of  the 
singing  BIRD  with  the  BIRDS'  parameters  in  the  BOX  pattern. 

4.3  Conclusions  from  the  Simulation  Study 

1.  Estimation  and  identification  procedures  based  on  assumptions  of 
normal  errors  are  sensitive  to  outlying  observations. 

2.  Estimation  procedures  based  on  assumptions  of  a  long-tailed  error 
distribution  are  more  robust  to  incorrect  error  distribution  assumptions 
than  normal  estimation  procedures.  , 

3.  Bayes  estimation  procedures  are  sensitive  to  incorrect 
specification  of  the  prior  distribution  of  which  BIRD  is  singing. 

4.  The  following  attributes  affect  the  ability  to  correctly  identify 
the  singing  BIRD. 

a.  If  each  BIRD  has  more  than  1  parameter,  correlation  between  the 
parameters'  observation  errors  can  influence  the  difficulty  of 
identification  of  the  correct  BIRD. 

b.  The  configuration  of  the  parameter  space  for  the  BIRDS  can  make 
correct  identification  more  difficult,  e.g.  LINE,  BOX. 
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c.  The  location  of  the  parameter  of  the  singing  BIRD  in  the 


parameter  apace  can  make  correct  identification  easier  or  harder,  e.g. 
middle  BIRD  or  end  BIRD  in  the  univariate  parameter  case. 
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APPENDIX 


HIT  PROBABILITY  WHEN  BIRDS  ARE  ON  A  LINEAR  LATTICE,  ERRORS  ARE  e~ 
CONTAMINATED,  AND  A  LINEAR  SUMMARY,  NEAREST-NEIGHBOR  ALGORITHM  IS  USED: 
LINEAR  CONSENSUS  PROCEDURES  NEED  NOT  SHOW  "SAFETY  IN  NUMBERS". 


Suppose  BIRD  characteristics  Uj  are  at  equal  intervals: 


u  “0,  ±1,  ±2,  ...  with  no  limit  to  number.  Let  the  i  of  I  WATCHERS 

J 


have  the  e-contaminated  error  density 


V  —  Q  2  «  A.  4 

[-  i  1-3-  *  «  «p[-  1  (-M  1-3—  (*-') 


x -e  2 


f  (Xj;0)  -  e  exp 


?v_  j  j  -•+  P  '  a 

0,,  —  c  °2i 


'll 


i 


✓57 


21 


where 


e  +  e  .  i ,  \  z  o,  e  Z  0.  It  is  known  that  a  BLUE  of  0  is 


I  x  /a 
1-1 


(A-2) 


BLUE 


I  1/a* 

i-1 


where  here 


2  -  2  2 

°i  "  e  °1i  e  °2i 


(A-3) 


and  that 
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1 


(A-4) 


0BLUE^  I  0 

I  I/O* 

i-1  1 

Clearly  there  is  a  tendency  for  the  above  variance  to  decrease  with  I,  so 

one  might  conclude  that  adding  more  WATCHERS  improves  hit  probability.  This 

conclusion  is  false.  Perhaps  more  surprisingly,  existence  of  theoretical 

population  moments  does  not  Seem  to  govern  the  behavior  of  the  linear 
A  2 

estimate,  Of  course,  if  Oj  doesn't  exist  then  the  above  weighting 

can  not  be  carried  out,  but  if  the  error  scale  is  the  same  for  all  WATCHERS 

then  equal  weighting  is  suggested.  It  can  be  seen  analytically  that  the 

Student  t  with  one  d.f.  (the  Cauchy)  error  model  implies  that  0  has 

BLUE 

exactly  the  same  distribution  regardless  of  the  number  of  WATCHERS,  and  this 
effect  is  plainly  visible  from  simulations  and  numerical  calculations.  On 
the  other  hand,  the  e-contaminated  Normal/Gauss  error  model  has  all  moments 
finite,  and  yet  can  exhibit  a  hit  probability  that  decreases  with  I,  later 
of  course  increasing  as  it  must,  eventually,  by  central  limit  theorem 
efTects.  For  what  Is  possibly  a  plausible  example,  the  advantage  of  number 
becomes  evident  only  after  about  a  dozen  WATCHERS  are  performing 


simultaneously! 


Simplest  Case:  Statistically  Identical  WATCHERS 

Let  -  Oy  -  Oy  o2  <  a\.  To  calculate  hit  probability,  assume 
with  no  loss  of  generality  that  BIRD  0  is  singing.  Then  the  probability  of 

a  hit  is  the  probability  that  the  ordinary  average  of  I  errors  lies  between 

1  „  1 
-  -  and  > 


.  X,  +  X  +  ...+  xT 
P {  HIT }  -  p{~  <  - - \ - -  <  \\ 


(A-5) 


Condition  on  the  error  components  involved:  if  G  represents  the  number  of 

2 

"good"  (small  variance,  o^)  observations,  and  B  ■  I  -  G  the  number  of  "bad" 
2  - 

(large  variance,  then  G  -  Binomial  (e,I)  and,  given  G, 


x1  ♦  x2+  ...♦  Xj 


Gof  ♦  (I-G)o? 

m(o.-3— 3 - *) 

r 


(A-6) 


P  {  HIT  |  G**g }  -  2$( - - - )  "  1  -  igfI 

/(g(o?  -  a\)  ♦  Iol)/Iz 


(A-7) 


Consequently,  when  the  condition  is  removed, 


P { HIT}  -  l  (^](e)g(e)I~g  JL  . 

g-0  g 


( A-8) 
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Numerical  Illustration 


Suppose  BIRDS  occur  as  above  and  that  WATCHERS  are  independent  with  e- 
contaminated  errors  having  parameters  e  »  .25,  -  0.5,  o21«  10oi2’ 

A 

so  o2l  ■  5.  Then  the  BLUE  is  the  ordinary  average,  X  -  0,  which  is  also  a 
Bayes  estimate  if  one  were  to  assume  that  the  error  distribution  is  simple 
Normal  and  the  prior  probabilities  equal.  Adopt  the  NN  approach  (what 
else?)  to  identify  the  singing  BIRD.  Then  we  tabulate  the 

HIT  PROBABILITIES 

ALGORITHM  NUMBER  OF  WATCHERS,  I 


2 

4 

6 

8 

10 

12 

14 

A 

9  -  AVERAGE,  NN 

0.54 

0.48 

0.48 

0.49 

0.51 

0.55 

0.57 

A 

9  -  MEDIAN,  NN 

0.5*4 

0.77 

0.86 

0.92 

0.94 

0.96 

0.98 

The  effect  mentioned  Is  quite  striking,  with  linear  0  hit  probability 
awryyKov  rff  quickly,  recovering  slowly,  and  not  approaching  that  of  the 
median  until  a  value  of  I  much  larger  than  any  in  our  table  is  reached. 
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Table  1 


Proportion  of  Correct  Identifications 
BIRDS  Equally  Likely 


Number  of  WATCHERS: 


Parameter  Estimate 

Mean 

Median 

Iter.  T  ldf 

Iter.  T  3df 

Iter.  CN  e-0.1 

Iter.  CN  e-0.25 


Bayes  Estimates 
Normal 
T  ldf 
T  lOdf 
CN  e-0.1 
CN  e-0.25 


Normal 
a  -.5 


True  Error  Distribution 

Student-t 
1  df 


Contaminated 


Normal 


e  -0 . 

1 

e  -0.25 

2 

3  4 

5 

2  3  4  5 

2  3 

4  5 

2  3  4  5 

.88 

.93  .96 

.98 

.60  .60  .61  .60 

.77  .78 

.79  .77 

.64  .61  .59  .58 

00 

00 

• 

.89  .95 

.95 

.60  .75  .78  .83 

.77  .85 

.91  .92 

.64  .78  .81  .86 

.88 

.91  .95 

.97 

.61  .76  .80  .85 

.77  .88 

.92  .92 

.64  .81  .85  .90 

.88 

.93  .96 

.98 

.61  .76  .80  .84 

.77  .88 

.92  .95 

.65  .80  .84  .89 

• 

00 

oo 

.93  .96 

.98 

.60  .74  .78  .82 

.77  .89 

.93  .96 

.64  .82  .86  .91 

.88 

.93  .96 

.98 

.60  .75  .78  .83 

.77  .89 

.93  .96 

.64  .82  .86  .91 

oo 

oo 

• 

.93  .96 

.98 

.60  .59  .60  .60 

.77  .78 

.78  .77 

.64  .60  .58  .57 

.86 

.92  .95 

.97 

.68  .77  .82  .86 

.81  .88 

.93  .95 

.75  .82  .87  .91 

oo 

00 

• 

.93  .96 

.98 

.65  .74  .79  .83 

.80  .88 

.92  .94 

.70  .78  .83  .87 

.88 

.93  .96 

.98 

.67  .75  .79  .83 

.82  .90 

.94  .96 

.75  .83  .88  .92 

.87 

.95  .96 

.98 

.67  .75  .79  .83 

.82  .89 

.94  .96 

.75  .83  .88  .92 

a*  if-  tf-  «r-  *  -  i  f  vX 
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Table  2 

Proportion  of  Correct  Identifications 
BIRD  with  PARAMETER  1  SINGS 

True  Error  Distribution 


Normal 
o  *0.5 


Student-t  Contaminated 

1  df _ 

E-0.1 


Normal 
e  -0.25 


Number  of  WATCHERS: 


Parameter  Estimate 

Mean 

Median 

Iter.  T  ldf 

Iter.  T  3df 

Iter.  CN  e-0.1 

Iter.  CN  e-0.25 

Bayes  Estimates 

Normal 

T  ldf 

T  lOdf 

CN  e-0.1 

CN  e-0.25 


2 

3 

4 

5 

2 

3 

4 

5 

2 

3 

4 

5 

.92 

.96 

.98 

.99 

.75 

.75 

.75 

.74 

.85 

.86 

.86 

.86 

.92 

.93 

.97 

.97 

.75 

.85 

.66 

.89 

.85 

.91 

.94 

.95 

.92 

.94 

.97 

.98 

.74 

.85 

.86 

.90 

.85 

.92 

.95 

.96 

.92 

.95 

.97 

.98 

.74 

.84 

.86 

.89 

.85 

.92 

.95 

.96 

.92 

.96 

.98 

.99 

.75 

.84 

.86 

.89 

.85 

.93 

.96 

.97 

.92 

.95 

.97 

.98 

.75 

.84 

.87 

.90 

.85 

.93 

.96 

.97 

.92 

.96 

.98 

.99 

.77 

.79 

.81 

.82 

.85 

.87 

.87 

.87 

.91 

.95 

.97 

.98 

.78 

.85 

.88 

.91 

.87 

.93 

.96 

.97 

.92 

.96 

.98 

.99 

.77 

.84 

.87 

.89 

.86 

.92 

.95 

.96 

.92 

.96 

.98 

.99 

.76 

.82 

.87 

.89 

.87 

.93 

.96 

.97 

.92 

.95 

.97 

.98 

.76 

.82 

.87 

.89 

.87 

.93 

.96 

.97 

77 

.75 

.74 

.75 

77 

.86 

.89 

.92 

77 

00 

• 

00 

oo 

• 

.93 

76 

.87 

.89 

.93 

77 

.89 

.92 

.95 

77 

.89 

.91 

.95 

78 

.76 

.76 

oo 

r^ 

• 

82 

.88 

.92 

.94 

80 

.86 

.89 

.92 

82 

.89 

.92 

.95 

82 

.89 

.92 

.95 

Table  3 


Proportion  of  Correct  Identifications 
BIRD  with  PARAMETER  3  SINGS 

True  Error  Distribution 


Normal 
a  -0.5 

Student-t 

1  df 

Contaminated 

Normal 

e-0.1 

e  -0  .25 

Number  of  WATCHERS: 

2 

3  4 

5 

2 

3  4  5 

2  3 

4  5 

2 

3  4  5 

Parameter  Estimate 

Mean 

.84 

.91  .96 

.97 

.49 

.50  .50  .50 

.71  .72 

.72  .72 

.54 

.51  .49  .48 

Median 

.84 

.86  .93 

.94 

.49 

.69  .72  .79 

.71  .81 

.88  .90 

.54 

.73  .77  .83 

Iter.  T  ldf 

.84 

.89  .94 

.96 

.51 

.70  .76  .81 

.71  .84 

.90  .93 

.55 

.76  .81  .87 

Iter.  T  3df 

.84 

.90  .95 

.97 

.51 

.70  .74  .81 

.71  .85 

.90  .93 

.56 

.75  .80  .87 

Iter.  CN  e-0.1 

.84 

.91  .95 

.97 

.49 

.68  .73  .78 

.71  .86 

.92  .95 

.56 

.77  .82  .89 

Iter.  CN  e-0.25 

.84 

.90  .95 

.97 

.49 

.69  .73  .79 

.71  .86 

.91  .95 

.54 

.77  .82  .89 

Bayes  Estimates 

Normal 

.84 

.91  .96 

.97 

.49 

.50  .50  .49 

.71  .72 

.72  .72 

.54 

.51  .48  .48 

T  ldf 

.82 

.89  .94 

.96 

.62 

.71  .78  .82 

.77  .85 

.91  .94 

.69 

.78  .84  .89 

T  lOdf 

.84 

.91  .95 

.98 

.56 

.68  .73  .79 

.75  .83 

.89  .93 

.63 

.72  .79  .85 

CN  e-0.1 

.84 

.91  .95 

.97 

.49 

.68  .73  .78 

.71  .86 

.92  .95 

.54 

.77  .82  .89 

CN  e-0.25 

.84 

.90  .95 

.97 

.49 

.69  .73  .79 

_ 

.71  .86 

.91  .95 

_ 

.54 

.77  .82  .89 

jJjLtjMLX*.  if.  tCajfmjf  mjf*  f  .  -1  -  *f »  /  »-<  .  A.*,  a.-, 
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Table  4 


LINE  Pattern 
Equally  Likely  BIRDS 
Proportion  of  Correct  Identifications 


Assumed  RHO-O.5 

CORRELATION  COEFFICIENT  USED  TO 
GENERATE  THE  ERROR 


True 

Error 

Dist. 

CN(e-O) 


CN(e-O.l) 


CN(e-.25) 


T  1  d.f. 


RHO-O.5 

RHO-0 

RHO- 

"0.5 

Number  of 

WATCHERS: 

2 

3  4 

5 

2 

3  4  5 

2  3 

4 

5 

Estimate 

MEDIAN 

.92 

.94  .98 

.98 

.96 

.97  .99  .99 

1  .99 

1 

1 

NORMMLE 

.92 

.96  .99 

.98 

.96 

.99  .99  1 

1  1 

1 

1 

TMLE  ldf 

.90 

.95  .98 

.99 

.94 

.98  .99  1 

1  1 

1 

1 

TMLE  3df 

.92 

.95  .98 

.99 

.96 

.98  .99  1 

1  1 

1 

1 

TMLE  lOdf 

.92 

.95  .98 

.99 

.96 

.99  .99  1 

1  1 

1 

1 

CNMLE(e-.I) 

.92 

.96  .99 

.99 

.96 

.99  .99  1 

1  1 

1 

1 

CNMLE(e-.25) 

.92 

.96  .99 

.99 

.96 

.99  .99  1 

1  1 

1 

1 

MEDIAN 

.82 

.91  .95 

.96 

.85 

.93  .96  .98 

.87  .97 

.97 

.99 

NORMMLE 

.82 

.81  .83 

.84 

.85 

.84  .83  .83 

.87  .85 

.84 

.82 

TMLE  ldf 

.87 

.94  .96 

.97 

.92 

.97  .98  1 

.97  .99 

1 

1 

TMLE  3df 

.88 

.94  .97 

.98 

.93 

.97  .98  .99 

.97  .99 

1 

1 

TMLE  lOdf 

.88 

.94  .97 

.97 

.92 

.96  .98  .99 

.95  .99 

.99 

1 

CNMLE(e-O.l) 

.89 

.94  .97 

.98 

.93 

.97  .99  1 

.97  .99 

1 

1 

CNMLE( e*  .25) 

.89 

.94  .97 

.98 

.93 

.97  .99  1 

.97  .99 

1 

1 

MEDIAN 

.68 

.85  .86 

.92 

.70 

.87  .88  .93 

.73  .88 

.89 

.94 

NORMMLE 

.68 

.67  .65 

.68 

.70 

.67  .67  .65 

.73  .68 

.67 

.65 

TMLE  ldf 

.82 

.91  .93 

.97 

.86 

.94  .97  .98 

.93  .96 

.98 

.99 

TMLE  3df 

.82 

.91  .93 

.96 

.86 

.94  .96  .98 

.92  .95 

.98 

.99 

TMLE  lOdf 

.80 

.89  .92 

.96 

.85 

.91  .94  .96 

.89  .92 

.96 

.98 

CNMLE(e-O.l) 

.83 

.92  .94 

.97 

.87 

.94  .97  .98 

.93  .97 

.98 

.99 

CNMLE( e»  .25) 

.83 

.92  .94 

.97 

.87 

.95  .97  .98 

.93  .97 

.98 

.99 

MEDIAN 

.63 

.79  .82 

.87 

.69 

.83  .86  .91 

.77  .88 

.90 

.93 

NORMMLE 

.63 

.63  .65 

.64 

.69 

.68  .70  .69 

.77  .77 

.74 

.76 

TMLE  ldf 

.78 

.84  .89 

.91 

.80 

.89  .94  .95 

.88  .94 

.97 

.98 

TMLE  3df 

.76 

.84  .88 

.91 

.80 

.89  .94  .94 

.88  .94 

.96 

.98 

TMLE  lOdf 

.75 

.80  .85 

.89 

.79 

.87  .92  .94 

.87  .93 

.95 

.97 

CNMLE ( e*0 . 1 ) 

.76 

.81  .86 

.89 

.80 

.87  .92  .94 

.87  .92 

.95 

.97 

CNMLE( c* .25) 

.72 

.82  .87 

.89 

.79 

.87  .92  .94 

.87  .92 

.96 

.97 

,>.v  ■ 


Table  5 


LINE  Pattern 
Equally  Likely  BIRDS 
Proportion  of  Correct  Identifications 

Assumed  p  -  “0.5 


CORRELATION  COEFFICIENT  USED  TO 
GENERATE  THE  ERROR 


RHO-0.5 

RHO-O 

RHO-' 

'0.5 

Number 

of 

WATCHERS : 

2 

3  4 

5 

2 

3  4  5 

2  3 

4  5 

True 

Error 

Dist . 

Estimate 

CN(e-O) 

MEDIAN 

.92 

.95  .97 

.98 

.96 

.97  .99  .99 

1  1 

1  1 

NORMMLE 

.92 

.97  .98 

.99 

.96 

.99  .99  1 

1  1 

1  1 

TMLE  ldf 

.89 

.95  .97 

.99 

.95 

.98  1  1 

.99  1 

1  1 

TMLE  3df 

.91 

.96  .98 

.99 

.96 

.98  1  1 

.99  1 

1  1 

TMLE  lOdf 

.91 

.96  .98 

.99 

.96 

.99  .99  1 

1  1 

1  1 

CNMLE(e-O.l) 

.89 

.94  .97 

.99 

.95 

.98  1  1 

1  1 

1  1 

CNMLE(e-.25) 

.88 

.94  .96 

.98 

.94 

.98  .99  1 

.99  1 

1  1 

CN(e-O.l) 

MEDIAN 

.82 

.90  .95 

.96 

.86 

.94  .96  .98 

.87  .97 

.98  .99 

NORMMLE 

.82 

.83  .82 

.83 

.86 

.85  .83  .84 

.87  .87 

.84  .83 

TMLE  ldf 

.88 

.92  .96 

.98 

.93 

.97  .98  .99 

.98  .99 

1  1 

TMLE  3df 

.89 

.93  .96 

.98 

.93 

.97  .99  1 

.98  .99 

1  1 

TMLE  lOdf 

.89 

.93  .96 

.98 

.93 

.97  .98  .99 

.97  .99 

.99  1 

CNMLE(e-O.l) 

.87 

.92  .96 

.97 

.93 

.97  .99  .99 

.98  1 

1  1 

CNMLE(e-.25) 

.87 

.91  .95 

.97 

.93 

.97  .98  .99 

.98  1 

1  1 

CN(e».25) 

MEDIAN 

.69 

.82  .85 

.92 

.71 

.86  .87  .94 

.72  .89 

.88  .95 

NORMMLE 

.69 

.68  .64 

.66 

.71 

.68  .66  .66 

.72  .67 

.66  .66 

TMLE  ldf 

.82 

.88  .92 

.95 

.87 

.93  .97  .98 

.93  .97 

.99  .99 

TMLE  3df 

.83 

.88  .92 

.95 

.87 

.93  .97  .98 

.92  .97 

.99  .99 

TMLE  lOdf 

.83 

.87  .92 

.96 

.87 

.92  .96  .97 

.90  .95 

.98  1 

CNMLE(e-O.l) 

.82 

.88  .92 

.96 

.87 

.94  .97  .98 

.93  .98 

.99  1 

CNMLE<e-.25) 

.81 

.88  .91 

.95 

.87 

.94  .97  .98 

.93  .98 

.99  1 

T  1  d.f. 

MEDIAN 

.63 

.79  .82 

.87 

.68 

.83  .85  .92 

.76  .89 

.91  .94 

NORMMLE 

.63 

.64  .64 

.63 

.68 

.69  .70  .70 

.76  .76 

.78  .77 

TMLE  ldf 

.76 

.81  .87 

.91 

.81 

.89  .93  .96 

.89  .94 

.97  .98 

TMLE  3df 

.76 

.82  .88 

.91 

.81 

.89  .93  .96 

.89  .94 

.97  .98 

TMLE  lOdf 

.75 

.81  .87 

.90 

.79 

.87  .92  .95 

.88  .93 

.97  .98 

CNMLE(g-O.I) 

.74 

.80  .85 

.88 

.79 

.88  .91  .95 

.88  .93 

.97  .98 

CNMLE( E* .25) 

.74 

.80  .85 

.88 

.79 

.88  .91  .94 

.88  .93 

.97  .97 

Table  6 


LINE  Pattern 
BIRD  (3,3)  Always  Sings 
Proportion  of  Correct  Identifications 

Assumed  RHO  -0.5 


CORRELATION  COEFFICIENT  USED  TO 
GENERATE  THE  ERROR 


RHO-O.5 

RHO-O 

RHO- 

i 

o 

• 

in 

Number 

of 

WATCHERS : 

2 

3  4 

5 

2  3  4  5 

2  3 

4 

5 

True 

Error 

Estimate 

Dist . 

CN(e-O) 

MEDIAN 

.90 

.93  .97 

.98 

.95  .97  .99  .99 

1  .99 

1 

1 

NORMMLE 

.90 

.96  .98 

.99 

.95  .99  .99  1 

1  1 

1 

1 

TMLE  ldf 

.87 

.94  .97 

.99 

.94  .98  .99  .99 

.99  1 

1 

1 

TMLE  3df 

.89 

.95  .98 

.99 

.95  .99  .99  1 

.99  1 

1 

1 

TMLE  lOdf 

.89 

.96  .98 

.99 

.95  .99  .99  1 

.99  1 

1 

1 

CNMLE(e-O.l) 

.90 

.96  .98 

.99 

.95  .99  .99  1 

.99  1 

1 

1 

CNMLE( e-,25) 

.89 

.96  .98 

.99 

.95  .98  .99  1 

.99  1 

1 

1 

CN(  c-0 . 1 ) 

MEDIAN 

.75 

.88  .94 

.95 

.81  .93  .96  .97 

.85  .96 

.96 

.99 

NORMMLE 

.75 

.78  .78 

.77 

.81  .80  .79  .80 

.85  .81 

.79 

.80 

TMLE  ldf 

.84 

.92  .96 

.97 

.90  .96  .98  .99 

.97  .99 

1 

1 

TMLE  3df 

.85 

.92  .96 

.98 

.90  .97  .98  .99 

.97  .98 

1 

1 

TMLE  lOdf 

.84 

.91  .96 

.98 

.89  .96  .98  .99 

.95  .97 

.99 

1 

CNMLE(e-O.l) 

.85 

.92  .96 

.98 

.91  .97  .98  .99 

.97  .99 

1 

1 

CNMLE( e-.25) 

.85 

.92  .96 

.98 

.91  .97  .98  .99 

.97  .99 

1 

1 

CN( e-0 .25) 

MEDIAN 

.59 

.81  .83 

.90 

.62  .82  .85  .93 

.66  .85 

.87 

.93 

NORMMLE 

.59 

.57  .58 

.57 

.62  .59  .57  .58 

.66  .59 

.59 

.57 

TMLE  ldf 

.78 

.86  .92 

.95 

.84  .91  .96  .98 

.89  .95 

.98 

.99 

TMLE  3df 

.78 

.87  .93 

.95 

.83  .91  .95  .98 

.89  .94 

.97 

.99 

TMLE  lOdf 

.76 

.85  .91 

.94 

.80  .89  .94  .96 

.85  .92 

.96 

.97 

CNMLE(e-O.l) 

.79 

.88  .93 

.96 

.84  .93  .96  .98 

.90  .95 

.98 

.99 

CNMLE( e- .25) 

.79 

.88  .93 

.96 

.84  .93  .97  .98 

.90  .96 

.98 

.99 

T  1  d.f. 

MEDIAN 

.53 

.74  .76 

.85 

.61  .80  .81  .89 

.69  .84 

.87 

.93 

NORMMLE 

.53 

.54  .54 

.56 

.61  .63  .61  .62 

.69  .69 

.71 

.73 

TMLE  ldf 

.71 

.81  .85 

.90 

.76  .86  .90  .94 

.84  .91 

.95 

.98 

TMLE  3df 

.70 

.80  .85 

.90 

.75  .86  .90  .94 

.84  .91 

.95 

.98 

TMLE  lOdf 

.68 

.78  .83 

.89 

.73  .85  .89  .93 

.83  .89 

.95 

.98 

CNMLE(e-O.l) 

.70 

.79  .83 

.87 

.75  .84  .88  .92 

.83  .89 

.93 

.97 

CNMLE( €- .25) 

.70 

.79  .83 

.88 

.75  .84  .89  .93 

.83  .89 

.93 

.97 
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Table  7 


r. 


BOX  Pattern  _ 

Equally  Likely  BIROS 
Proportion  of  Correct  Identifications 

Assumed  RHO  -  0.5 


r. 


i 

>» 

V 

5. 

■. 

5 

1 

K 


I 


j  * 
$ 

P 


t; 

0 

K 

i’¬ 

ll 

r- 

r- 

i  * 

r- 

K 
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CORRELATION  COEFFICIENT  USED  TO 
GENERATE  THE  ERROR 


RH0-0.5 

RHO-O 

RH0-"0.5 

Number 

of 

WATCHERS: 

2 

3  4 

5 

2  3  4  5 

2 

3  4  5 

True 

Estimate 

Error 

Dlst . 

CN(e-O) 

MEDIAN 

.96 

.97  .98 

.99 

.97  .97  .99  .99 

.96 

.96  .99  .95 

NORMMLE 

.96 

.98  .99 

1 

.96  .99  1  1 

.94 

.98  .99  1 

TMLE  ldf 

.96 

.98  .99 

.99 

.96  .98  .99  1 

.93 

.97  .98  1 

TMLE  3df 

.96 

.98  .99 

.99 

.96  .98  .99  1 

.94 

.97  .99  1 

TMLE  lOdf 

.96 

.98  .99 

1 

.96  .99  1  1 

.95 

.98  .99  1 

CNMLE(e-O.l) 

.96 

.98  .99 

1 

.96  .98  1  1 

.93 

.96  .98  .99 

CNMLE(e-.25) 

.96 

.98  .99 

1 

.95  .98  .99  1 

.92 

.96  .98  .99 

CN(e-O.l) 

MEDIAN 

.83 

.93  .96 

.98 

.84  .94  .96  .98 

.84 

.93  .96  .97 

NORMMLE 

.83 

.81  .82 

.79 

.84  .83  .81  .82 

.83 

.81  .82  .79 

TMLE  ldf 

.92 

.96  .98 

.99 

.92  .97  .99  1 

.90 

.95  .97  .98 

TMLE  3df 

.92 

.97  .98 

.99 

.93  .97  .99  .99 

.90 

.95  .98  .98 

TMLE  lOdf 

.90 

.96  .98 

.99 

.91  .97  .98  .99 

.90 

.94  .98  .98 

CNMLE(e-O.l) 

.93 

.97  .99 

.99 

.93  .97  .99  .99 

.90 

.94  .97  .98 

CNMLE(e-.25) 

.93 

.97  .98 

.99 

.93  .97  .99  .99 

.90 

.94  .97  .98 

CN(e-0.25) 

MEDIAN 

.70 

.87  .87 

.92 

.70  .85  .88  .93 

.71 

.87  .87  .93 

NORMMLE 

.69 

.63  .63 

.61 

.69  .64  .62  .62 

.70 

.64  .60  .61 

TMLE  ldf 

.87 

.93  .96 

.98 

.87  .93  .96  .97 

.84 

.91  .94  .97 

TMLE  3df 

.87 

.93  .96 

.97 

.87  .93  .95  .97 

.84 

.91  .94  .97 

TMLE  lOdf 

.85 

.91  .95 

.96 

.85  .90  .94  .96 

.83 

.90  .93  .95 

CNMLE(e-O.l) 

.88 

.94  .96 

.98 

.87  .93  .96  .98 

.84 

.91  .94  .97 

CNMLE( €■ .25) 

.88 

.94  .97 

.98 

.87  .93  .96  .98 

.84 

.90  .94  .97 

T  1  d.f. 

MEDIAN 

.66 

.83  .86 

.91 

.67  .83  .86  .91 

.66 

.83  .85  .91 

NORMMLE 

.67 

.71  .69 

.69 

.66  .67  .66  .66 

.63 

.63  .63  .63 

TMT.E  ldf 

.81 

.88  .92 

.95 

.80  .87  .92  .95 

.77 

.86  .90  .93 

TMLE  3df 

.81 

.88  .93 

.95 

.80  .87  .92  .95 

.77 

.85  .90  .92 

TMLE  lOdf 

.79 

.87  .92 

.94 

.77  .86  .90  .94 

.75 

.84  .89  .91 

CNMLE(e-O.l) 

.79 

.87  .91 

.94 

.78  .85  .90  .93 

.76 

.85  .88  .91 

CNMLE(e-.25) 

.80 

.87  .91 

.94 

.75  .85  .90  .93 

.75 

.84  .88  .91 
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